National Repository of Grey Literature 7 records found  Search took 0.01 seconds. 
Semantic Analysis of Web Content
Hubl, Lukáš ; Rychlý, Marek (referee) ; Burget, Radek (advisor)
This work deals with the topics of semantic web, web page segmentation and technologies, which are used in this area. It also deals with a modification of one web page segmentation method, specifically DOM-based segmentation, using semantic web technologies. Thus, this work designs the way of web page segmentation based on semantic analysis of individual elements of the web pages content. An application that demonstrates the functionality of the designed segmentation method was also created within this work. With the implemented application, experiments were performed, whose results are also part of this work.
Page Segmentation in a Web Browser
Zubrik, Tomáš ; Polčák, Libor (referee) ; Burget, Radek (advisor)
This thesis deals with the web page segmentation in a web browser. The implementation of Box Clustering Segmentation (BCS) method in JavaScript using an automated browser was created. The actual implementation consists of two main steps, which are the box extraction (leaf DOM nodes) from the browser context and their subsequent clustering based on the similarity model defined in BCS. Main result of this thesis is a functional implementation of BCS method usable for web page segmentation. The evaluation of the functionality and accuracy of the implementation is based on a comparison with a reference implementation created in Java.
New Web Page Segmentation Methods
Malaník, Michal ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
The aim of this work is to introduce a new vision based web page segmentation method. This method is based on very popular VIPS segmentation algorithm, which is trying to represent the segmented web document in the same way as it is perceived by a user using a web browser. Compared to the VIPS algorithm, there are some optimizations for modern websites in our method, especially for documents created in the HTML 5 language. We also deal with the implementaion of the proposed method using the FITLayout framework.
Vision-based Web Page Segmentation
Maštera, František ; Hynek, Jiří (referee) ; Burget, Radek (advisor)
The FitLayout library offers a suite of implemented web page segmentation algorithms along with a number of tools for their evaluation and further development. The goal of this thesis is to extend this suite by another of already existing algorithms. To meet this goal, the Cormier et al. algorithm was chosen and integrated into the FitLayout. The plausibility of its implementation against its publication has been duly verified. Its extensive evaluation was also carried out to determine its properties and behaviour under different circumstances, which revealed algorithm settings that improve the quality of its outputs on the tested data sample by up to 9.89 %. As a result of this thesis, the FitLayout library has been extended with a new web page segmentation algorithm, which can be used in further research in this area that can be supported with the results found in this thesis.
Page Segmentation in a Web Browser
Zubrik, Tomáš ; Polčák, Libor (referee) ; Burget, Radek (advisor)
This thesis deals with the web page segmentation in a web browser. The implementation of Box Clustering Segmentation (BCS) method in JavaScript using an automated browser was created. The actual implementation consists of two main steps, which are the box extraction (leaf DOM nodes) from the browser context and their subsequent clustering based on the similarity model defined in BCS. Main result of this thesis is a functional implementation of BCS method usable for web page segmentation. The evaluation of the functionality and accuracy of the implementation is based on a comparison with a reference implementation created in Java.
Semantic Analysis of Web Content
Hubl, Lukáš ; Rychlý, Marek (referee) ; Burget, Radek (advisor)
This work deals with the topics of semantic web, web page segmentation and technologies, which are used in this area. It also deals with a modification of one web page segmentation method, specifically DOM-based segmentation, using semantic web technologies. Thus, this work designs the way of web page segmentation based on semantic analysis of individual elements of the web pages content. An application that demonstrates the functionality of the designed segmentation method was also created within this work. With the implemented application, experiments were performed, whose results are also part of this work.
New Web Page Segmentation Methods
Malaník, Michal ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
The aim of this work is to introduce a new vision based web page segmentation method. This method is based on very popular VIPS segmentation algorithm, which is trying to represent the segmented web document in the same way as it is perceived by a user using a web browser. Compared to the VIPS algorithm, there are some optimizations for modern websites in our method, especially for documents created in the HTML 5 language. We also deal with the implementaion of the proposed method using the FITLayout framework.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.